Runtime configurable arithmetic and logic cell

ABSTRACT

A cascadable arithmetic and logic unit (ALU) which is configurable in function and interconnection. No decoding of commands is needed during execution of the algorithm. The ALU can be reconfigured at run time without any effect on surrounding ALUs, processing units or data streams. The volume of configuration data is very small, which has positive effects on the space required and the configuration speed. Broadcasting is supported through the internal bus systems in order to distribute large volumes of data rapidly and efficiently. The ALU is equipped with a power-saving mode to shut down power consumption completely. There is also a clock rate divider which makes it possible to operate the ALU at a slower clock rate. Special mechanisms are available for feedback on the internal states to the external controllers.

This application is a continuation of International Patent ApplicationPCT/DE97/02949 filed on Dec. 9, 1997, and a continuation-in-part of U.S.patent application Ser. No. 08/946,810, filed on Oct. 8, 1997.

BACKGROUND INFORMATION

German Patent No. 44 16 881 describes a method of processing data, wherehomogeneously arranged cells which can be configured freely in functionand interconnection are used.

Independently of the above-mentioned patent, field programmable gatearray (FPGA) units are being used to an increasing extent to assemblearithmetic and logic units and data processing systems from a pluralityof logic cells.

Another known method is to assemble data processing systems from fixedprogram-controlled arithmetic and logic units with largely fixedinterconnection, referred to as systolic processors.

Problems Units Described in German Patent No. 44 16 881

Units described in German Patent No. 44 16 881 (referred to below as“VDUs”) are very complicated to configure owing to the large number oflogic cells. To control one logic cell, several control bits must bespecified in a static memory (SRAM). There is one SRAM address for eachlogic cell. The number of SRAM cells to be configured is very large,thus, a great deal of space and time is needed for configuring andreconfiguring such a unit. The great amount of space required isproblematical because the processing power of a VPU increases with anincrease in the number of cells, and the area of a unit that can be usedis limited by chip manufacturing technologies. The price of a chipincreases approximately proportionally to the square of the chip area.It is impossible to broadcast data to multiple receivers simultaneouslybecause of the repeated next-neighbor interconnection architecture. IfVPUs are to be. reconfigured on site, it is absolutely essential toachieve short reconfiguration times. However, the large volume ofconfiguration data required to reconfigure a chip stands in the way ofthis. There is no possibility of separating cells from the power supplyor having them cycle more slowly to minimize the power loss.

FPGAs

FPGAs for the use in the area described here usually includemultiplexers or look-up table (LUT) architectures. SRAM cells are usedfor implementation. Because of the plurality of small SRAM cells, theyare very complicated to configure. Large volumes of data are required,necessitating a comparably large amount of time for configuration andreconfiguration. SRAM cells take up a great deal of space, and theusable area of a unit is limited by the chip manufacturing technologies.Here again, the price increases approximately proportionally to thesquare of the chip area. SRAM-based technology is slower than directlyintegrated logic due to the SRAM access time. Although many FPGAs arebased on bus architectures, there is no possibility of broadcasting forrapid and effective transmission of data to multiple receiverssimultaneously. If FPGAs are to be reconfigured at run time, it isabsolutely essential to achieve short configuration times. However, thelarge volume of configuration data required stands in the way. FPGAs donot offer any support for reasonable reconfiguration at run time. Theprogrammer must-ensure that the process takes place properly withoutinterfering effects on data and surrounding logic. There is nointelligent logic to minimize power loss. There are no special functionunits to permit feedback on the internal operating states to the logiccontrolling the FPGA.

Systolic Processors

Reconfiguration is completely eliminated with systolic processors, butthese processors are not flexible because of their rigid internalarchitecture. Commands are decoded anew in each cycle. As describedabove, there are no functions which include broadcasting or efficientminimization of power loss.

SUMMARY

The present invention relates to a cascadable arithmetic and logic unit(ALU) which is configurable in function and interconnection. No decodingof commands is needed during execution of the algorithm. It can bereconfigured at run time without any effect on surrounding ALUs,processing units or data streams. The volume of configuration data isvery small, which has positive effects on the space required and theconfiguration speed. Broadcasting is supported through the internal bussystems in order to distribute large volumes of data rapidly andefficiently. The ALU is equipped with a power-saving mode to shut downpower consumption completely. There is also a clock rate divider whichmakes it possible to operate the ALU at a slower clock rate. Specialmechanisms are available for feedback on the internal states to theexternal controllers.

The present invention is directed to the architecture of a cell asdescribed in, for example, German Patent No. 44 16 881, or, or example,conventional FPGA cells. An expanded arithmetic and logic unit (EALU)with special extra functions is integrated into this cell to perform thedata processing. The EALU is configured by a function register whichgreatly reduces the volume of data required for configuration. The cellcan be cascaded freely over a bus system, the EALU being decoupled fromthe bus system over input and output registers. The output registers areconnected to the input of the EALU to permit serial operations. A buscontrol unit is responsible for the connection to the bus, which itconnects according to the bus register. The unit is designed so thatdistribution of data to multiple receivers (broadcasting) is possible. Asynchronization circuit controls the data exchange between multiplecells over the bus system. The EALU, the synchronization circuit, thebus control unit and registers are designed so that a cell can bereconfigured on site independently of the cells surrounding it. Apower-saving mode which shuts down the cell can be configured throughthe function register; clock rate dividers which reduce the workingfrequency can also be set.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows an example arrangement of several PAEs forming one PA withPLU. Without connection to input/output systems or memories.

FIG. 2 shows an example architecture of a PAE.

FIG. 3 shows an example architecture of F-PLUREG and M-PLUREG.

FIGS. 4 a-4 c shows an example architecture of an O-REG.

FIGS. 5 a-5 c shows an example architecture of an O-REGsft with a rightshift function.

FIGS. 6 a-6 c shows an example architecture of an R-REGsft withright/left 1-2 bit barrel shifter.

FIGS. 7 a-7 b shows an example architecture of the R20-MUX andimplementation of a MUX in transfer gate technology.

FIG. 8 shows example clock synchronization, delays and sync signals.

FIGS. 9/1-9/2 shows the functioning of an example sync UNIT.

FIG. 10 shows an example architecture of a power UNIT.

FIG. 11 shows an example architecture of a sync UNIT.

FIG. 12 shows an example architecture of a BM UNIT.

FIG. 13 shows an example architecture of an O-MUX, limited to four bussystems.

FIG. 14 shows an example architecture of an R GATE, limited to four bussystems.

FIG. 15 shows an example architecture of the state-back UNIT.

FIG. 16 shows an exemplary operating principle of the one-shot MODE andthe one-shot/power-save MODE.

FIG. 17 shows an example of implementation of a PAE.

FIG. 18 shows an example architecture of a PAE, with the individualfunctions being linked via a bus system.

FIG. 19 shows an example operation of the config state machine.

FIG. 20 shows an example-operation of a loop state machine.

FIG. 21 shows an example cyclical processing of the configurationregister data.

DETAILED DESCRIPTION

The present invention relates to the design of a cell (e.g., processingarray element or “PAE”) as described in German Patent No 44 16 881, or,for example, conventional FPGA cells, where the PAEs can be cascaded toform an array (e.g., a processing array or “PA”). One PAE is composed ofa plurality of function units.

EALU

The computing unit includes an expanded arithmetic and logic unit (EALU)permanently implemented in the logic unit. An EALU is an ordinary knownarithmetic and logic unit (ALU) which has been expanded by specialfunctions such as counters. This EALU is capable of performing aplurality of arithmetic and logic operations; which do not have to bespecified here exactly, because it is possible to refer to known ALUs.The EALU has direct access to its own results (described below) whichare returned as the operand. Thus counters or serial operations such asserial multiplication, division or series expansion are possible. Inaddition to its result, the EALU delivers the signals CarryOut-AlessBand AequalB-0detect. CarryOut-AlessB specifies either the carry-over inarithmetic operations, or in comparisons by means of subtraction of twovalues, it specifies the carry-over, i.e., CarryOut-AlessB, that A <B orB<A, depending on the operand negated. The signal is the usualcarry-over generated by a full adder. AequalB-0detect specifies that theresult in the result register R-REGsft is zero. The signal is generatedby a NOR element from the result. The signals are used for simple statusanalysis and can be returned to the PLU. Additional status signals canbe implemented, depending on the application.

The function of the EALU is configured in a function register(F-PLUREG).

O-REG

The input operands of the EALU are stored in two independent operandregisters (O-REG). The input operands are thus available independentlyof the status of the unit supplying the data (data transmitter). Thispermits decoupling from the bus and for the PAs to be freelyreconfigurable. One or both O-REGs have a shift function which istriggered by the EALU, for each O-REG individually, if so required. Theshift function makes it possible to perform serial operations such asserial multiplication or division in the EALU. O-REGsft denotes O-REGswith a shift function.

R-REGsft

The result of the EALU is stored in a result register (R-REGsft). Thisprovides time independence of the unit or units receiving the result(data receivers). The R-REGsft has a shift function which is triggeredby the EALU, thus permitting serial operations.

R20 MUX

The result data available in R-REGsft is introduced as an operandthrough a multiplexer (R20-MUX) between one of the O-REGs and the EALUto guarantee feedback of results for serial operations, counters andsimilar functions. The multiplexer is set by the F-PLUREG.

Clock Cycles

It is appropriate but not absolutely necessary to trigger the O-REG sftat a clock pulse edge and the R-REGsft at the subsequent negated clockpulse edge. Thus, the EALU has a half clock pulse available to carry outits function; the second half clock pulse is available for signalpropagation times and multiplexers. Thus, it is possible to perform acomplete operation in each clock pulse.

StateMachine, SM Unit

An SM UNIT is available for sequence control in the EALU. The SM UNITcontrols the O-REG and R-REGsft and their shift function, as well ascontrolling the R20-MUX. Consequently, serial operations and shift andcounting functions can be performed easily by the EALU. The statemachine is easy to implement by the conventional methods.

Sync UNIT

A synchronization unit (Sync UNIT) is provided for synchronization ofone PAE in an array (PA) of PAEs. The Sync UNIT analyzes a series ofinput signals, which execute a handshake protocol.

-   rACK(h/l): The data receiver acknowledges the data received, rACKh    being the acknowledgment of the high result byte (bits 8 through 15)    and rACKl being the acknowledgment of the low result byte (bits 0    through 7). The two are linked with an AND (rACKh AND rACKl) and    yield the signal rACK. rACK is not true while one or both data    receivers are busy processing their data and becomes true when the    processing of the data of both data receivers is concluded, and the    result is stored in the R-REGsft of the respective data receiver.    The rACK(h/l) signals are often considered below in their AND-linked    form as rACK (=rACKh & rACKl).-   oRDY (1/2): The data transmitter signals its readiness to send new    data. oRDY is not true while the data transmitter is busy processing    its data, and it becomes true when the result of the data    transmitter, which is the operand of the PAE, is available. oRDY1 is    the enable signal of the data transmitter of the first operand, and    oRDY2 is that of the second. The two are linked with an AND (oRDY1    AND oRDY2) and yield the oRDY signal. oRDY is true only when both    data transmitters are ready to send data. The oRDY(1/2) signals are    often considered below in their AND-linked form as oRDY (=oRDY1 &    oRDY2).

Output signals generated from the input signals and the status of thesync UNIT which, together with the sequence control of the EALU,represent the overall status of the PAE; those output signals are inturn regarded as input signals by the sync UNITs of the datatransmitters and receivers. Status information and the F-PLUREG registerare used for sequence control of the EALU.

-   rRDY: Indicates that the PAE has finished its data processing and a    result is available in R-REGsft. rRDY is transmitted as rRDYh and    rRDYl to both data receivers. However, it is the same signal in each    case.-   oACK: Indicates that the PAE has processed its operands and can    receive new data in O-REG(sft). oACK is transmitted as oACK1 and    oACK2 to both data transmitters. However, it is the same signal in    each case.

The RDY signals retain their level up to the input of the acknowledgmentthrough ACK. This is necessary when the data receiver is reconfiguredwhile the data are being made available. If RDY is applied untilacknowledged through ACK, the data receiver will recognize after thereconfiguration that data is available and will accept it.

The linking of the signals over multiple PAEs is as follows:

Data transmitter PAE Data receiver rRDY → oRDY rRDY → oRDY rACK ← oACKrACK ← oACK

This means that the output signal rRDY of the data transmitter, forexample, represents the input signal oRDY1 or oRDY2 of PAE. The outputsignal rRDY of PAE is the input signal oRDY of the data receiver.

The sync UNIT has the following types of sequences:

Mode Description Comments Wait OP The PAE waits for Only if no operandsmultiple-cycle operation is taking place Cycle 1 A single-cycle Operandsare operation is acknowledged being carried out Cycle n One cycle of a —multiple-cycle operation is being carried out Cycle Z The last cycle ofOperands are a multiple-cycle acknowledged operation is being carriedout Wait ERG The PAE waits for Only if a acknowledgment of precedingresult the result exists Stop Stops execution — after conclusion of thecurrent cycle, then acknowledges ReConfig if the result has also beenacknowledged

The sync UNIT makes available a special mode which enables the clocksignal only when operands are available. This mode is appropriateespecially when the data transmitters do not supply the data in eachclock pulse of the processor but instead only in every n^(th) clockpulse. Then the clock cycle corresponds to one period of the normalclock, cycle and is enabled through rACK or oRDY(1/2). The enable isreferred to as OneShot. This mode is called the OneShot MODE. The clockpulse is AND-linked with one of the enable signals through an AND gate.The mode and signal selection take place through F-PLUREG. The enablesignal generated through rACK or oRDY(1/2) can be lengthened by the SMUNIT. This is necessary so that operations which need more than onecycle can be carried out in one-shot MODE. To make this possible, acorresponding signal line of the SM UNIT is OR-linked to the enablesignal.

If the registry entry STOP is set in. F-PLUREG, the sync UNIT runs thecurrent function to the end. Thereafter, no more operands are acceptedor acknowledged. As soon as rACK indicates that the result has beenaccepted by the data receiver, the readiness of the PLU forreconfiguration is indicated by the ReConfig signal. The signal isgenerated when rACK stores the stop of F-PLUREG in a D flip-flop.ReConfig can be polled by read access of the PLU to F-PLUREG at the stopbit position.

Likewise, the sync UNIT can be used to generate and analyze error statesor other status signals.

BM UNIT

To apply the operands and the result to the external bus systems, thereis a bus multiplex unit (BM UNIT). The BM UNIT has two multiplexers andtwo gates, with the two multiplexers being for the operands (O-MUX) andthe two gates for the result (R-GATE), one switch being for thehigher-order result and one for the low-order result. The multiplexersand switches are controlled over the multiplexer register (M-PLUREG).The sync UNIT signals are controlled over the switches to the bus. Thecorrelation of the multiplexers/switches and signals is as follows:

O-MUX1:oRDY1, oACKO-MUX2:oRDY2, oACKRH-GATE:rRDY, rACKhRL-GATE:rRDY, rACKl

The R-GATE can be brought by M-PLUREG to a state in which it does notdrive a bus system.

The table gives a description of the signals and their respectiveinterface architecture:

Signal oRDY oACK rRDY rACK D7-0 Indicates operands operands resultresult data ready acknowledged ready acknowledged Type input open driverinput bidirectional collector

It is possible to operate several data receivers from one datatransmitter (broadcasting). To do so, several data receivers areconnected to the same bus. To guarantee acknowledgment of the data, thedriver stage of the acknowledgment line oACK is designed as an opencollector driver. The bus operates as wired AND, i.e., only when alldata receivers have acknowledged the data does the H level required forthe acknowledgment occur. This is accomplished by the fact that thisdata receiver which does NOT acknowledge the data pulls the bus to an Llevel over an open collector transistor. Data receivers whichacknowledge the data do not trigger the open collector transistor andthus add no load to the bus. When all the data receivers haveacknowledged the data, there is no more load on the bus and it assumesan H level via a pull-up resistor.

State-Back UNIT

The PAE is capable of supplying feedback about its operating state toits primary logic unit, hereinafter called PLU (see DE 44 16 881 A1).The primary logic unit configures the PA and requires information aboutthe status of the individual PAEs in order to be able to performappropriate reconfigurations. This is done through the StateBack UNIT.This transmits either the lower 3-bit of the result from the R-REGsft—todeliver calculated values to the PLU—the CarryOut-AlessB andAequalB-0detect signals to a 3-bit status bus, depending on the entry inthe F-PLUREG. To allow signals to be impressed from several PAEs, asimple wired-OR method over open collector drivers is used. In order forthe reconfiguration of the PAE to begin only when the receiver hasacknowledged the data, a latch unit can be inserted between the signalsand the open collector drivers to enable the signals only after receiptof rACK. The status bus is monitored by the PLU, which reacts by itsprogram flow and reconfiguration to the status supplied by the bus.

Power UNIT

The PAE has a power-saving mode (Sleep MODE) which, like the function ofthe EALU, is set in F-PLUREG. There is a bit for this which, when set,starts the sleep MODE. To do so, either the clock line of the PAE is setat a constant logic 0 or 1, or the voltage of the PAE is turned off viaa transistor. F-PLUREG is always energized within the PAE and cannot beswitched off. Unused areas (gates) of the PAE for the function executedare switched off by analysis of the F-PLUREG. This is done through atransistor which isolates those areas from the power supply. To preventunwanted interference, the outputs of the areas are defined overpull-up/pull-down resistors.

In addition, the power save MODE can also be used within the one-shotMODE which is controlled by the sync UNIT. All parts of the PAE exceptthe F-PLUREG, M-PLUREG and sync UNIT are isolated from the power supply.Only when the sync UNIT senses a one-shot mode are all the required PAEparts connected over the power UNIT. The sync UNIT delays the clocksignal until all the newly connected parts are capable of functioning.

Registers

The F-PLUREG and M-PLUREG registers are connected to the PLU bus. Theaddresses of the data packages sent by the PLU are decoded in acomparator. If the address of the PAE is recognized, the data is storedin the registers. The PLU bus has the following architecture:

-   AX7-0: X address of the X/Y matrix-   AY7-0: Y address of the X/Y matrix-   RS: Register select; logic 0 selects F-PLUREG, logic 1 selects    M-PLUREG-   AEN: Address enable; the bus contains a final address. The addresses    must be decoded as long as AEN is logic 0. AEN is logic 0 during the    entire bus access, i.e., also during the data transmission.-   D23-00: Data-   DEN: Data enable; the bus contains final data. The data must be    transferred to the register at the rising edge of the DEN signal.-   OEN: Output enable; the PLU reads valid data out LS of the PLUREGs.

Architecture of F-PLUREG:

1. The architecture of the register in write access:

F11 F10-9 F8 F8 F5 F4-0 Stop StateBack Sleep OneShot PowerSave EALUfunction

The functions of the individual bits:

Function State Effect Stop 0 Normal function 1 Stop functions aftercurrent cycle, no acknowledgment of the operands StateBack 00 Mofeedback, bus open 01 D2-0 to bus 10 CarryOut-AlessB, AequalB-0detect tobus 11 Illegal Sleep 0 No function, de-energized 1 Normal function,energized OneShot 00 Normal function 01 OneShot to oRDY1 10 OneShot toORDY2 11 OneShot to (rACKh & rACKl) PowerSave 0 No PowerSave 1Power-saving mode in combination with one-shot mode EALU 00000 Nooperation (NOP) function 00001 . . . Function according to design of theEALU 11111

The reset state is 0 in all bits.

2. Register architecture in read access:

F11 ReConfig

The functions of the individual bits:

Function State Effect ReConfig 0 No reconfiguration possible 1Reconfiguration possible

The reset state is 0 in all bits.

Architecture of M-PLUREG:

M23-18 M17-12 M11-06 M05-00 High-order Low-order Operand 1 Operand 0result result

The values M(n+5)−.n=000000, nε{0,6,12,18 } mean that themultiplexers/switches are open and have no bus contact. It may beappropriate to block M-PLUREG via the ReConfig signal, i.e., as soon asReConfig is activated, PAE disconnects from all bus systems. The resetstatus is 0 in all bits.

SUMMARY

The function of PAE can be configured and reconfigured more easily andquickly than in known technologies, in particular the FPGA technology.The interconnection of the ALU is specified in M-PLUREG, whereas in thetraditional technologies a plurality of individual unrelatedconfiguration bits must be occupied. The transparent architecture of theregisters simplifies (re)configuration.

Due to the direct design as an ALU, the PAE requires less space than intraditional technologies where ALUs have been implemented by a pluralityof logic cells. At the same time, the run delays are lower and theattainable clock frequencies are higher accordingly.

A broadcasting function is guaranteed due to the design of the BM UNIT,and acknowledgment is automatic. A data transfer that is largelyindependent in time is guaranteed due to the input and outputregisters(O-REG, R-REG).

Configuration and reconfiguration are greatly simplified due to thedecoupling of each PAE from the overall system through the O-REG andR—REG registers since the individual PAEs are largely independent of oneanother. Feedback to the PLU as well as the interaction of STOP andReConfig permit efficient control of (re)configuration.

Power-saving functions have been implemented which lead to a reductionin power consumption—automatically to some extent (one-shot MODE).

The PAE architecture can be implemented in the FPGAs to increase theirefficiency. Arithmetic operations can thus be implemented much moreefficiently.

DESCRIPTION OF THE FIGURES

Referring now to the figures, FIG. 1 shows a simplified processoraccording to German Patent 44 16 881 A1. The PLU (0101) and its bussystem (0102) are shown. The PAEs (0103) are shown as an array, and theinternal bus system (0104) in the chip is indicated schematically.

FIG. 2 shows the schematic architecture of a PAE. The internal bussystem (0201) within the chip is connected to the BM UNIT (0202) whichconnects the buses selected by M-REG (0203) to O-REG1sft (0204) asoperand 1 and O-REG (0205) as operand 2. The result available in resultregister R-REGsft (0207) is optionally introduced into the data path ofoperand 2 over R20-MUX (0206). The data from O-REGsft (0204) and R20-MUX(0206) are processed in the EALU (0208). Feedback goes to the PLU overthe state-back UNIT (0209). The PLU bus (0210) is connected to theF-PLUREG (0211) and M-PLUREG (0212) registers and to the state-back UNIT(0209), and the PAE is configured and monitored over it. F-PLUREGcontains all functional configuration data, M-PLUREG contains theinterconnection information of the PAE. The sync UNIT (0212) controlsthe interaction of the data exchange between the data transmitters, thedata receivers and the processing RAE. The SM UNIT (0213) controls theentire internal sequence of the PAE. The power UNIT (0214) regulates thepower supply and serves to reduce power consumption.

FIG. 3 illustrates the functioning of the M-PLUREG and F-PLUREGregisters. The AX and AY addresses of the PLU bus (0308) are comparedwith the address of PAE in a comparator (0301) if AEN (address enable)indicates a valid bus transfer. Each PAE has a unique address composedof its line and column within a PA. If DEN (data enable) indicates datatransmission, then either M-PLUREG (0302) or F-PLUREG (0303) is selectedover RS (register select). The data are stored in the respectiveregister at the rising edge of DEN. The registers are implemented as Dflip-flops (0304). Timing diagram 0305 illustrates the sequence. TheReConfig signal is sent from the sync UNIT to the PLU bus only over thegate (0306) for read access to the F-PLUREG. Enable is based on theresult of the comparator (0301) AND the OEN signal.

FIG. 4 a shows a block diagram of O-REG. FIG. 4b shows how the O-REG isconstructed of D flip-flops. FIG. 4 cshows the timing diagram. The clocksignal is generated by SYNC-SM.

FIG. 5 a shows a block diagram of O-REGsft. FIG. 5 bshows how O-REGsftis constructed of D flip-flops (0501). The AND gates (0502) and OR gates(0503) form, via the inverter (0504), a mode-controlled multiplexer(0506) which either switches the input data to the D flip-flops (0501)or sends the output data of the D flip-flops, shifted by one bit, totheir inputs. The AND gate (0505) is not necessary, because one input ispermanently at logic 0. It is shown only for the purpose ofillustration. FIG. 5 c shows the timing diagram as a function of thesignal mode. The clock pulse is generated by SYNC-SM.

FIG. 6 a shows the block architecture of R-REGsft. Upstream from theregister (0601) there is a multiplexer (0602) which either switches theinput data to the register (0601) or directs the shifted output data ofthe register (0601) to its input. The clock pulse generated by SYNC-SMis sent to the register, shifted by one half clock pulse. FIG. 6 b showsthe block diagram on a gate level. Modes 0-2 switch a multiplexer (0606)consisting of AND gates with a downstream OR gate via a decoder (0603).Gates shown with dotted lines (0605, etc.) are shown only for the sakeof illustration. They are functionless, because one input is constantlyat L. The multiplexer switches the input signal to the registers (0607)in modes 0-2=010. In modes 0-2=000 to modes 0-2=001, the output valuesof the registers (0607) are shifted by one bit to the left, and in modes0-2=011 to modes 0-2=100 they are shifted by two bits to the right andsent to the inputs of the registers. The timing diagram in FIG. 6 cillustrates the shift function, depending on modes 0-2.

FIG. 7 a shows the architecture of multiplexer R20-MUX which sends theoperands or the result to EALU as a function of mode. FIG. 7 a shows atraditional multiplexer design, while FIG. 7 b shows the space-savingand power-saving variant due to the use of CMOS transfer gates (0701).All the multiplexers described in this document can be constructed withtransfer gates.

A gate may be composed of transfer gates in a design equivalent to thatof the multiplexer. However, the direction of data exchange is exactlythe opposite!

FIG. 8 shows the relationship of PAE-internal clock CLK to theactivities taking place. At the rising edge (0801), operands are storedin the O-REG. During the H level (0802), PAE processes the data(ΔPAE=processing phase). This includes the data traffic between. O-REGand R-REG. At the trailing edge (0803) the result is stored in R-REG.The L level (0804) is used for distribution of the BM UNIT enclosed inthe bus system (Δnetwork=bus phase). The time sequence of signals (oRDYand oACK, rRDY and rACK) generated by SYNC-SM is shown in the timediagram.

FIG. 9 shows the sequence diagram of the sync UNIT. The state machinerecognizes two fixed states, DATA (0901) and RESULT (0902). DATA issynchronized at the rising edge, RESULT at the trailing edge. The statusof the input parameters is analyzed and, depending on the result, themachine jumps to the YES branch (0903/0904) or the NO branch(0905/0906). If the operands are not ready in DATA, the machine jumps toNO. No operation is carried out in the next steps until the machinejumps back according to DATA and performs an analysis again. Ifoperands, indicated by oRDY, are ready, the operands are stored in O-REG(0907). The operands are processed (0908) and analyzed (0909) at thesame time to ascertain whether it is the last cycle in multiple-cycleoperations (serial operations requiring more than one clock cycle) orwhether a single-cycle operation is taking place. In these cases, theoperands are acknowledged by oACK (0910). RESULT is synchronized withthe trailing edge, and a check is performed to determine whether the“result available” flag has been set (0911). This flag is always setwhen a finished result is signaled by rRDY (0912).

In two cases, the system jumps to the YES branch (0904):

1. There is no preceding result (“result available” flag is false).2. There is a preceding result (“result available” flag is true) and itis acknowledged with rACK. In this case (and only in this case(!)) 0902resets the result (0913).

Otherwise the system jumps to the NO branch (0906) and no operation iscarried out until the state machine returns after RESULT (0902). In theYES branch (0904) the result is stored in output register R-REGsft(0914). It is then determined whether this is the last cycle of amultiple-cycle operation (0915) (cf. 0909) or a single-cycle operation;if yes, the presence of the result is signaled by rRDY (0916). The statemachine returns after DATA (0901). The recognition of whether it is thelast cycle of an operation—or a single-cycle operation—can be queried ofthe SM UNIT via the FINISH signal (0916). This is active when the last-or single-cycle takes place. The status of the sync UNIT is signaled tothe SM UNIT via RUN (0917). RUN is active in the event an operation istaking place; otherwise it is inactive. The mechanism of the STOP entryin F-PLUREG and the ReConfig generated from this are not shown in FIG. 9because the sequence is trivial and is apparent from the description ofthe sync UNIT.

FIG. 10 shows the basic architecture of the power UNIT. The sleep signalis sent from F-PLUREG to a transistor or a transistor stage (1001) whichcontrols the power supply for all cell functions that can be switchedoff. The sync UNIT delivers the one-shot power-save signal (see FIG. 16)which enables the power supply of the remaining cell functions via atransistor or transistor stage (1002). Depending on the functionsactually used in the cell, the transistors or transistor stages (1003)shut down the unneeded functions (power down). It is understandable thatother similar means such as capacitors, etc., must be provided forproper power supply and EMC behavior.

FIG. 11 shows the implementation of the machine from FIG. 9 in theconcept. The oRDY(1/2) and rACK signals are switched over the BM UNIT(1101) (in simplified terms—there is actually rACKh and RACKl,rACK=rACKl & rACKh) to the CCLK-controlled latches (1102). The latchesare switched in such a way that they are transparent in the L phase (busphase) of CCLK and hold their status in the H phase (processing phase).The outputs of the latches make available the signals for the sync statemachine (1103). rRDY (in simplified terms: there are actually rRDYh andrRDYl—they are completely identical but are sent to different receivers)from 1103 is switched to the bus over a gate. The oACK(1/2) signals from1103 are negated in the BM UNIT (1101) and sent to the inverting opencollector bus driver (1104). The bus is pulled to H over resistors(1105). The BM UNIT is switched so that the following cases occur:

1. If the corresponding bus is not driven by the BM UNIT, L is at thebase of the transistors (1104). Therefore, they place no load on thebus.2. If the corresponding bus is driven by the BM UNIT and the signal isnot acknowledged, the base of the transistors (1104) is at H. This meansthat the bus is pulled to L. If a result is distributed to multiple datareceivers by broadcasting, then all PAEs that have not yet acknowledgedthe result data and need waiting cycles pull the bus to L.3. If the corresponding bus is driven by the BM UNIT and the signal isacknowledged, the base of the transistors (1104) is at L. This meansthat the bus is not placed under load. If a result is distributed tomultiple data receivers by broadcasting, all PAEs which haveacknowledged the result data and do not need waiting cycles place noload on the bus.

In its initial state, the bus assumes the H level, i.e., acknowledgment,so non-acknowledgment according to case 2 overrides the acknowledgmentby pulling the bus to L. The bus goes to the H level, i.e., theacknowledgment state, only when all PAEs acknowledge the result data.Thus, a wired-AND circuit is implemented. The sync state machinesupplies the RUN signal (1107) to the SM UNIT (1106) which is running onthe basis of RUN. If the SM UNIT is in the last—or only processingcycle—it signals this to the sync state machine via FINISH (1108).FINISH is analyzed in the analysis units to recognize the last cycle(0907, 0915). The SM UNIT runs in sync with the PAE-internal clock CLK.

FIG. 12 shows the architecture of the BM UNIT. Depending on the entriesinto M-PLUREG, the multiplexers (1201, 1202) switch the operands fromthe internal bus (1203) to O-REG. Likewise, the gates (1204, 1205)switch the upper and lower halves of the result to the bus. Multiplexer1206 switches oRDY(1/2) according to the position of 1201 and 1202 andswitches rACK according to the position of 1204 and 1205 from the bus tothe PAE. The rACK signals of the two data receivers are linked togetherby AND. If there is only one data receiver, the multiplexer is switchedso that it returns a logic 1 instead of the missing rACK. 1207 containsa gate for switching the oACK(1/2) and rRDY signals to the bus. TheoACK(1/2) signals are first inverted and then switched to the bus overthe open collector driver (1104).

FIG. 13 illustrates the architecture of an O-MUX. There is a 3:5 decoder(1301) for analyzing mode 2-0 signals from M-PLUREG. The multiplexer isconstructed with AND gates (1302) and a downstream OR gate (1303). Theanalysis signal of mode 2-0=000 of the decoder (1301) is switcheddirectly to the OR gate (1304). This causes logic 1 to always bereturned in the open state, i.e., no connection to a bus system (seerACK in FIG. 12). Only a reduced bus size is shown for the sake ofsimplicity.

FIG. 14 shows the architecture of an R GATE. There is a 3:4 decoder(1401) for analyzing Mode 2-0 signals from M-PLUREG. The analysis signalof mode 2-0=000 of the decoder is not used. Therefore, no bus connectionis established with this bit combination. The gates (1402) are composedeither of AND gates or transmission gates (see 0701). An amplifier stagefor driving the bus load is connected upstream or downstream. Only areduced bus size is shown for the sake of simplicity.

FIG. 15 shows the state-back UNIT. Depending on the setting in M-PLUREG,a multiplexer (1501) switches through either the signalsCarryOut-AlessB, AequalB-0detect from the EALU or the outputs of R-REGR-REGD2-0. The signals go to an open collector transistor stage (1502)and are switched to the PLU bus. The PLU bus needs external pull-upresistors (1503) positioned near the PLU. Latch 1504 is optional. If itis inserted into the output signals of 1501, they are switched to thebus (1503) only after the data receiver has acknowledged the data viarACK. Consequently, the readiness for reconfiguration is indicated bythe status signals only when the data have been accepted. This isnormally regulated by the interaction of STOP and ReConfig in the syncUNIT. Therefore, the latch is optional. The rACK signal is used as thelatch clock pulse. The latch is transparent at rACK=1 and saved atrACK=0.

FIG. 16 illustrates the operation of the one-shot MODE.

The signals

1. Vcc

2. oRDY13. oRDY24. (oRDY1 & rACK)enable the cell clock via a multiplexer (1601) depending on the settingin F-PLUREG. The effect of the enable via Vcc is that the clock isalways running (see “normal operation” in the timing diagram).

In the three remaining modes, the clock does not run until the signalsor signal combinations enable the clock. The enable is synchronized toclock CCLK by a latch (1602), so the phase is not aborted prematurelywhen the enable signal is too short. The latch is transparent in the Lphase of CCLK and holds its value in the H phase. The enable signal goesto the AND gate pair (1603 and 1604) which enables the clock. Invertedclock signal !CLK is generated by an inverter (1605); CLK runs through atime-delay element (1606) to guarantee phase coincidence (see “one-shotoperation” in the timing diagram). CCLK is delayed over two delay pulses(1610) in the incoming line to 1604 to guarantee phase coincidence withCCLK which is applied to 1603 and has been delayed by the multiplexer(1608). If a PAE is brought into the power-save mode, the power supplyof the cell is mostly shut down. This is done via OR gate 1611. If thepower-save MODE has been activated, i.e., power save=1, the negatedsignal carries an L level. If one-shot MODE has also been switched onand register 1602 is at L, the power supply transistor in the power UNIT(see FIG. 17) is also switched off via the one-shot power-save signal.However, if register 1602 is at logic 1 (or power save=0), the powersupply transistor is switched on via 1611. The following table gives anoverview of the function:

Power Latch save (1602) Voltage Comments L X on H L off only whenone-shot MODE has been set! H H on

When switching on the power supply, there is a wake-up time of fixedduration until the cell is capable of functioning. To neverthelessfunction properly, the signals must be delayed accordingly. Therefore,CCLK is sent over a delay line (1607). A multiplexer (1608) selectsaccording to the power-save signal from F-PLUREG whether the normal ordelayed clock pulse is sent to the cell. Only the uninverted clock pulseis delayed by the Δpower-on period; the inverted clock pulse is notdelayed. Therefore, the result is available in synchronization with theremaining functions of the unit. Thus, the usable processing time isreduced to Δprocess. The maximum clock frequency of the unit thusdepends on Δpower-on+cprocess (see “one-shot operation with power save”in the timing diagram).

FIG. 17 shows an example of implementation of a PAE. The BM unit, thepower unit, the StateBack unit, the PLU bus and the M-PLUREG are notshown.

The PAE has three input registers oREG1 (1701), oREG2 (1702), oREG3(1703) for the data to be processed. The input registers receive theirdata from preceding PAEs via the BM unit. All the input registers areregisters without a shift function.

The F-PLUREGs (1704, 1705, 1706) determine a plurality of configurationsof the PAEs. They are loaded by the PLU over the PLU bus. Aconfiguration selected via the multiplexer (1723) is stored in eachF-PLUREG. The multiplexer (1723) is controlled by the register (1724).The register (1724) receives data or triggers from a preceding PAEthrough the BM unit. This PAE is not identical to the PAEs which supplythe data for the input registers. Of course, a larger or smaller numberof F-PLUREGs is also conceivable.

The third input register oREG3 (1703) supplies the operand for amultiply-add function, where the contents of the oREG1 (1701) aremultiplied by the contents of the oREG2 (1702) in the multiplier (1709)and then the contents of the oREG3 (1703) are added in theadder/comparator (1718). The adder/comparator is L configured so that itperforms an addition. If only multiplication is to be performed, theoREG3 (1703) is loaded with the value zero. If only addition is to beperformed, the F-PLUREG switches the multiplexer (1711). Thus, the valueof the oREG1 (1701) goes directly to the adder/comparator (1718). Thesecond function of the adder/comparator (1718) takes the value of theoREG1 (1701) and the value of oREG3 (1703) and compares the two values.The output signals CarryOut-AlessB and AequalB-0detect (1719) indicatewhether the two values are the same or the value of oREG3 (1703) isgreater or less than the value of oREG1 (1701).

Additional functions implemented in the PAE include a shift register(1712), a divider (1713), logic functions (1714) and AND, OR, NOT and acounter (1715). The counter (1715) generates a trigger signal (1720) assoon as it has counted from a predetermined value to zero. The counter(1715) is loaded directly with the value also received by oREG3 (1703).It is also conceivable to use other counters, such as incrementalcounters which count from zero to a loaded value and then generate atrigger signal.

The results of the function units are relayed by the multiplexer (1716)to the two output registers rREG1 (1710) and rREG2 (1702) which areconnected to the BM unit and thus relay the data to the downstream PAEs.The sequence is controlled by a sync unit (1708) connected to thetrigger logic (1707). In addition, it exchanges control signals with theconfig state machine, guaranteeing the correct sequence when theconfiguration is changed by the multiplexer (1711). The trigger logic(1707) is connected to the F-PLUREG and analyzes the incoming signals(1722) in accordance with the configuration stored in the F-PLUREGs. Theincoming signals include the ReConfig, the general trigger signal aswell as the handshake signals oRDY and rACK. Depending on theconfiguration, the trigger logic (1707) delivers the handshake signalsto the sync unit (1708), which in turn generates the enable signals forthe input and output registers and the counter. In addition, the syncunit (1708) generates outgoing handshake signals oACK and rRDY, relayingthem to the trigger logic (1707).

Depending on the configuration, the signals (1719) or the trigger of thecounter (1720) can be used as a general trigger signal and sent to thetrigger logic (1707). The signals (1721) ReConfig, handshake signalsoRDY and rACK and the general trigger signal are outgoing from thetrigger logic (1707) and sent to the BM unit.

FIG. 18 shows a PAE with the same extent of functions as the PAE in FIG.17. Here again, the BM unit, the power unit, the StateBack unit and theM-PLUREG are not shown. It has three input registers oREG1 (1801),,oREG2 (1802), oREG3 (1803), two output registers rREG1 (1804), rREG2(1818), three F-PLUREGs (1813, 1814, 1815), a multiplexer (1817), onesync unit (1805) and one trigger logic (1806). The function unitsinclude a divider (1807), a multiplier (1817), an adder/comparator(1809), logic functions (1810), a shift register (1811) and a counter(1812). The function of the individual units corresponds to thatdescribed with regard to FIG. 17. It is also conceivable to haveadditional functions integrated into the PAE, such as the trigonometricfunctions, root and exponential functions. This is of course also trueof the PAE described with regard to FIG. 17. The individual functionscan be implemented as integers or as floating point units. In contrastwith the PAE in FIG. 17, the individual function units are linked by abus system (1816), so that the individual functions can beinterconnected in any sequence. The wiring is determined by theconfiguration stored in the F-PLUREGs. The bus system (1816) may havedifferent designs. Possibilities include one or more segmented buseswhose segments connect two functions that are wired together or a numberof continuous buses wiring two function units together. In addition,there is the possibility of the individual functions and registerssending a target address with the help of which a connection isestablished.

FIG. 19 shows a config state machine which manages the configurationregisters. At the start, the config state machine is in the IDLE state(1901). This state is not left until after a rRDY signal has beenreceived from the config PAE selecting the configuration register. Thenthe config state machine enters the stop state (1902) and transmits astop signal to the sync unit of the PAE. The PAE terminates itsoperation at the next possible time, sending a stop acknowledge signalto the config state machine. The latter goes into the reload state(1903) and transmits an enable signal to the register of the config RAE.Then the config state machine changes into the restart state (1904),sends a start signal to the sync unit, which resumes its processing. Atthe same time, it sends an rACK signal to the config PAE. Finally, theconfig state machine returns to the IDLE state (1901).

FIG. 20 shows a state machine for automatically running through thevarious configurations of the PAE, which is referred to below as a loopstate machine. Since there are several F-PLUREGs, it is appropriateunder some circumstances to perform multiple operations in successionand only then relay the data as well as the trigger signals andhandshake signals to the downstream PAE.

The result of the individual operations is returned from the rREG of thePAE to the input registers via the feedback described above. Thisprocess can be controlled by an external stop, loop signal or aninternal counter.

At the beginning, the loop state machine is in the IDLE state (2001). Inthe IDLE state (2001) the loop state machine delivers a reset signal toa counter. This counter is used to dial the F-PLUREG. After the rRDY ofthe preceding PAE, the loop state machine enters the configuration state(2002), where it generates the handshake signals for the PAE and thecontrol signals for the config state machine. In addition, an enablesignal is generated for the counter, whose value is incremented by one.If the loop state machine then does not receive any stop loop signal orif the internal counter of the PAE has not yet reached its final value,it remains in the configuration state (2002), and the sequence describedabove is repeated. On arrival of a stop loop signal or when the internalcounter of the PAE has reached its final value, the state machinereturns to the IDLE state (2001) and the rRDY signal is relayed to thenext PAE.

FIG. 21 illustrates the part of the PAE needed for the sequentialrun-through of the configurations stored in the F-PLUREG. The F-PLUREGs(2104) receive their data from the PLU (2107) and are slightly modifiedin comparison with the embodiments described previously. Each F-PLUREGcontains an additional bit, the loop bit (2106). This loop bit isreturned to the loop state machine (2102) described with regard to FIG.20 through the link (2112). It functions there as a stop loop signal,i.e., with the loop bit (2106) set, the loop sequence is terminated;otherwise, it is continued until the loop stop bit is set, or theinternal. counter of the loop state machine (2101) described inconjunction with FIG. 20 has reached its final value. The loop statemachine (2101) controls the counter (2102) whose value controls theselection of the F-PLUREG (2104) by the multiplexer (2105). The controlsignals of the config state machine and the handshake signals of the PAEare transmitted through the link (2113). The configuration data of theF-PLUREG is relayed over the link (2108) to the function units of thePAE.

The counter (2102) receives an enable signal (2110) so that the value ofthe counter (2102) is incremented. In addition, the loop state machine(2101) sends a reset signal (2111) to the counter as soon as a loopoperation is concluded. A choice can be made between loop mode and thenormal process in the PAE by way of the multiplexer (2103). In a normalsequence, the multiplexer (2103) relays a data (2109) for selection ofan F-PLUREG (2104) to the multiplexer (2105).

DEFINITION OF TERMS

-   AequalB-0detect: Signal generated by the EALU, indicating in    arithmetic operations that the result is equal to zero. In    comparisons, it indicates that operand A is equal to operand B.-   ALU: Arithmetic and logic unit. Basic unit for processing data. The    unit can perform arithmetic operations such as addition, subtraction    or under some circumstances also multiplication, division, series    expansions, etc. The unit may be designed as an integer unit or as a    floating point unit. It may also perform logic operations such as    AND, OR and comparisons.-   BM UNIT: Unit for connecting the data to the bus systems outside the    PAE. The connection takes place via multiplexers for the data inputs    and gates for the data outputs. oACK lines are implemented as open    collector drivers. The BM UNIT is controlled by M-PLUREG.-   Broadcast: Transmitting data from a PAE to multiple data receivers.-   CarryOut-AlessB: Signal generated by the EALU which indicates a    carry-over in arithmetic operations. In comparisons, it indicates    that operand A is smaller than operand B.-   Data receiver: The unit(s) which processes further the results of    the PAE.-   Data transmitter: The unit(s) which makes available the data for the    PAE as operands.-   D flip-flop: Storage element that stores a signal at the rising edge    of a clock pulse.-   EALU: Expanded arithmetic and logic unit. An ALU expanded to add    special functions which are needed or appropriate for the operation    of a data processing system according to German Patent No. 44    16 881. These are counters in particular.-   FPGA: Known field-programmable gate array.-   F-PLUREG: Register in which the function of the PAE is set. The    one-shot and sleep modes are also set here. The PLU writes into the    register.-   Gate: Group of transistors that perform a basic logic function.    Basic functions include, for example, NAND, NOR. Transmission gates.-   H level: Logic 1 level, depending on the technology used,-   Handshake: Signal protocol in which one signal A indicates a state    and another signal B confirms that it accepts signal A and responds    (has responded) to it.-   Configuring: Determining the function and interconnecting a logic    unit, a (FPGA) cell or a PAE (see reconfiguring).-   Latch: Storage element which normally relays a signal transparently    during the H level and stores it during the L level. Latches where    the function of the levels is exactly reversed are occasionally used    in PAEs, in which case an inverter is connected upstream from the    clock of a conventional latch.-   L level: Logic 0 level, depending on the technology used.-   M-PLUREG: Register in which the interconnection of the PAE is set.    The PLU writes into the register.-   Next-neighbor interconnection: Interconnection of bus systems with    the adjacent neighbors abutting at the edges.-   O-MUX: Multiplexer within the BM UNIT which selects the bus system    of the operands.-   One shot: Mode in which a PAE operates at a lower clock rate than    the processor clock rate. The clock rate is synchronous with the    processor clock rate and corresponds to one period. There is no    phase shift. The clock pulse is enabled by one of the signals    oRDY(1/2) or rRDY. This mode serves to save power when the data    transmitters or receivers are transmitting or receiving data at a    slower rate than the processor clock rate.-   Open collector: Circuitry in which a bus signal pulled to the H    level via a pull-up is applied to the collector of a transistor. The    emitter of the transistor is grounded. If the transistor switches,    the bus signal is pulled to the L level. The advantage of this    method is that a plurality of such transistors can control the bus    without electrical collision. The signals are OR linked, resulting    in wired-OR.-   O-REG: Operand register for storing the operands of the EALU. It    makes it possible for the PAE to be independent of the data    transmitters in time and function. This simplifies data transfer    because it can be asynchronous or packet-oriented. At the same time,    the possibility of reconfiguring the data transmitters independently    of the PAEs or reconfiguring the PAEs independently of the data    transmitters is created.-   O-REGsft: O-REG with a shift register controlled by the SM UNIT.-   PA: Processing array: array of PAEs.-   PAE Processing array element: EALU with O-REG, R-REG, R20-MUX,    F-PLUREG, M-PLUREG, BM UNIT, SM UNIT, sync UNIT, state-back UNIT and    power UNIT.-   PLU: Unit for configuring and reconfiguring the PAW. Configured by a    microcontroller adapted specifically to its task.-   Power-save MODE: Power-saving mode within the one-shot MODE. None of    the parts of the PAE except the F-PLUREG, M-PLUREG and sync unit are    supplied with voltage when no operation is being carried out.-   Power Unit: Unit which regulates the power-saving functions.-   Pull-down: Resistor which pulls a bus line to an L level.-   Pull-up: Resistor which pulls a bus line to an H level.-   R GATE: Switch within the BM UNIT which switches the result to the    corresponding bus system. Some signals are switched over open    collector drivers. The R GATE works as a bus driver and can enter a    bus-neutral mode.-   R20-MUX: Multiplexer for inserting the result in an R-REGsft into    the data path between O-REG and EALU.-   R-REGsft: Result register for storing the result of the EALU. It    makes it possible for the PAE to be independent, in time and    function, of the data receivers. This simplifies data transfer    because it can be asynchronous or packet-oriented. At the same time,    this creates the possibility of reconfiguring the data receivers    independently of the PAE or reconfiguring the PAE independently of    the data receivers. The register is provided with a shift function    which is controlled by the SM UNIT.-   Serial operations: Operations performed by serial processing of a    data word or an algorithm. Serial multiplication, serial division,    series expansion.-   Sleep MODE: Power-saving mode in which the PAE, except for F-PLUREG,    carries no voltage.-   SM UNIT: State machine UNIT. State machine which controls the EALU.-   StateBack UNIT: Unit which controls the feedback of the status    signals to the PLU. Consists of a multiplexer and an open collector    bus driver stage.-   Sync UNIT: Unit responsible for the synchronization of the PAE with    the data transmitters and receivers, which also monitors the    reconfiguration of PAEs. At the same time, it assumes the one-shot    functions.-   Gate: Switch that forwards or blocks a signal. Simple comparison:    relay.-   Reconfiguring: New configuration of any number of PAEs while any    remaining PAEs continue with the same function (see configuring).-   State machine: Logic which can assume various states. The    transitions between the states depend on various input, parameters.    These are known machines that are used to control complex functions.

Conventions

Naming conventions

Component: UNIT

Operating mode: MODE

Multiplexer: MUX

Negated signal: notVisible register for PLU: PLUREGInternal register: REGShift registers: sftFunction conventionsShift registers: sftAND function: &

A B Q 0 0 0 0 1 0 1 0 0 1 1 1OR function: #

A B Q 0 0 0 0 1 1 1 0 1 1 1 1NOT function: !

A Q 0 1 1 0GATE function: G

EN D Q 0 0 — 0 1 — 1 0 0 1 1 1

1-3. (canceled)
 4. An integrated circuit comprising: a plurality ofprocessing cells interconnected by a network; wherein: at least some ofthe cells have a configurable arithmetic logic unit (ALU); and at leastsome of the cells are configurable, perform at least one of arithmeticand logic data processing, and have at least one register, a clocksupply of the at least one register being suppressible.
 5. Theintegrated circuit according to claim 4, wherein the suppressibility ofthe clock supply depends on acceptability, by a receiver, of output dataof the at least some of the cells that perform the at least one of thearithmetic and logic processing.
 6. The integrated circuit according toclaim 4, wherein at least some of the processing cells are runtimeconfigurable.
 7. The integrated circuit according to claim 4, whereinthe at least some of the cells having the configurable ALU have at leastone input register.
 8. The integrated circuit according to claim 4,wherein the at least some of the cells having a configurable ALU have atleast one output register.
 9. The integrated circuit according to claim4, wherein the clock supply is suppressible for power saving.
 10. Theintegrated circuit according to claim 4, wherein the suppressibility ofthe clock supply depends on availability of input data to the at leastsome of the cells that perform the at least one of the arithmetic andlogic processing.
 11. The integrated circuit according to claim 10,wherein each of the at least some of the cells having the configurableALU has at least 3 inputs.
 12. The integrated circuit according to claim11, wherein the network is configurable.
 13. The integrated circuitaccording to claim 12, wherein the integrated circuit is a FieldProgrammable Gate Array (FPGA) integrated circuit.
 14. An integratedcircuit comprising: a plurality of processing cells interconnected by anetwork; wherein: at least some of the cells have a configurablearithmetic logic unit (ALU); and at least some of the cells areconfigurable, perform at least one of arithmetic and logic dataprocessing, and have at least one register, a clock supply to the atleast one register being configurably suppressible.
 15. The integratedcircuit according to claim 14, wherein at least some of the processingcells are runtime configurable.
 16. The integrated circuit according toclaim 14, wherein each of the at least some of the cells having theconfigurable ALU has at least 3 inputs.
 17. The integrated circuitaccording to claim 16, wherein the at least some of the cells having theconfigurable ALU have at least one input register.
 18. The integratedcircuit according to claim 16, wherein the at least some of the cellshaving the configurable ALU have at least one output register.
 19. Theintegrated circuit according to claim 18, wherein the integrated circuitis a Field Programmable Gate Array integrated circuit (FPGA).
 20. Theintegrated circuit according to claim 14, wherein the network isconfigurable.
 21. The integrated circuit according to claim 14, whereinthe clock supply is suppressible for power saving.
 22. An integratedcircuit comprising: a plurality of processing cells interconnected by anetwork; wherein: at least some of the cells have a configurablearithmetic logic unit (ALU); and at least some of the cells areconfigurable, perform at least one of arithmetic and logic dataprocessing, and have at least one register, an arrangement beingprovided with the at least one register for disabling a clock supply ofthe at least one register.
 23. The integrated circuit according to claim22, wherein the disablement of the clock supply depends on availabilityof input data to the at least some of the cells that perform the atleast one of the arithmetic and logic processing.
 24. The integratedcircuit according to claim 22, wherein the disablement of the clocksupply depends on acceptability, by a receiver, of output data of the atleast some of the cells that perform the at least one of the arithmeticand logic processing.
 25. The integrated circuit according to claim 22,wherein each of the at least some of the cells having a configurable ALUhas at least 3 inputs.
 26. The integrated circuit according to claim 25,wherein each of the at least some of the cells having a configurable ALUhas at least one input register.
 27. The integrated circuit according toclaim 26, wherein at least some of the processing cells are runtimeconfigurable.
 28. The integrated circuit according to claim 22, whereineach of the at least some of the cells having a configurable ALU has atleast one output register.
 29. The integrated circuit according to claim22, wherein the network is configurable.
 30. The integrated circuitaccording to claim 22, wherein the disablement of the clock supplyprovides power saving.
 31. The integrated circuit according to claim 22,wherein the integrated circuit is a Field Programmable Gate Array (FPGA)integrated circuit.